Overview

Dataset info

Number of variables 27
Number of observations 500137
Missing cells 274972 (2.0%)
Duplicate rows 0 (0.0%)
Total size in memory 396.0 MiB
Average record size in memory 830.3 B

Variables types

NUM 12
CAT 11
BOOL 4

Variables

CHANNEL
Categorical

Distinct count 4
Unique (%) < 0.1%
Missing 0
Missing (%) 0.0%
Memory size 3.8 MiB
T
280054
R
219648
C
 
272
B
 
163
Value Count Frequency (%)  
T 280054 56.0%
 
R 219648 43.9%
 
C 272 0.1%
 
B 163 < 0.1%
 

Composition

Contains chars True
Contains digits False
Contains whitespace False
Contains non-words False

Length

Max length 1
Mean length 1
Min length 1
Scatter

CREDIT_SCORE
Real number (ℝ≥0)

Distinct count 391
Unique (%) 0.1%
Missing 2711
Missing (%) 0.5%
Infinite 0
Infinite (%) 0.0%
Mean 712.5362124
Minimum 300
Maximum 839
Zeros 0
Zeros (%) 0.0%
Memory size 3.8 MiB
Mini histogram

Quantile statistics

Minimum 300
5-th percentile 620
Q1 676
median 719
Q3 756
95-th percentile 788
Maximum 839
Range 539
Interquartile range (IQR) 80

Descriptive statistics

Standard deviation 54.79126197
Coefficient of variation (CV) 0.0768961086
Kurtosis 2.801586688
Mean 712.5362124
Median Absolute Deviation (MAD) 44.36682138
Skewness -0.891283879
Sum 354434038
Variance 3002.082389
Histogram
Histogram with fixed size bins (bins=10)
Value Count Frequency (%)  
748 3881 0.8%
 
756 3870 0.8%
 
754 3826 0.8%
 
766 3772 0.8%
 
764 3757 0.8%
 
747 3757 0.8%
 
734 3749 0.7%
 
760 3674 0.7%
 
753 3664 0.7%
 
745 3629 0.7%
 
Other values (380) 459847 91.9%
 
Value Count Frequency (%)  
300 511 0.1%
 
333 1 < 0.1%
 
359 1 < 0.1%
 
363 1 < 0.1%
 
366 1 < 0.1%
 
Value Count Frequency (%)  
839 1 < 0.1%
 
838 2 < 0.1%
 
837 2 < 0.1%
 
835 1 < 0.1%
 
832 1 < 0.1%
 

DELINQUENT
Boolean

Distinct count 2
Unique (%) < 0.1%
Missing 0
Missing (%) 0.0%
Memory size 488.5 KiB
False
482146
True
 
17991
Value Count Frequency (%)  
False 482146 96.4%
 
True 17991 3.6%
 

FIRST_PAYMENT_DATE
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count 73
Unique (%) < 0.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Mean 200025.431
Minimum 199901
Maximum 201103
Zeros 0
Zeros (%) 0.0%
Memory size 3.8 MiB
Mini histogram

Quantile statistics

Minimum 199901
5-th percentile 199903
Q1 199904
median 200005
Q3 200105
95-th percentile 200203
Maximum 201103
Range 1202
Interquartile range (IQR) 201

Descriptive statistics

Standard deviation 109.8155414
Coefficient of variation (CV) 0.0005490078981
Kurtosis -1.400368035
Mean 200025.431
Median Absolute Deviation (MAD) 100.6621303
Skewness 0.1487456108
Sum 1.00040119e+11
Variance 12059.45314
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[199901. 199901.5 199902.5 199904.5 199905.5 ... 200309.5 200311. 200401.5 200408. 201103. ], "bayesian blocks" binning strategy used)
Value Count Frequency (%)  
200105 72893 14.6%
 
199905 67536 13.5%
 
199903 62459 12.5%
 
199904 62279 12.5%
 
200104 57531 11.5%
 
200203 44289 8.9%
 
200103 40800 8.2%
 
200005 23578 4.7%
 
200004 20218 4.0%
 
200003 19070 3.8%
 
Other values (63) 29484 5.9%
 
Value Count Frequency (%)  
199901 8 < 0.1%
 
199902 1473 0.3%
 
199903 62459 12.5%
 
199904 62279 12.5%
 
199905 67536 13.5%
 
Value Count Frequency (%)  
201103 1 < 0.1%
 
200701 1 < 0.1%
 
200604 1 < 0.1%
 
200505 2 < 0.1%
 
200503 2 < 0.1%
 

FIRST_TIME_HOMEBUYER_FLAG
Boolean

MISSING
Distinct count 3
Unique (%) < 0.1%
Missing 130559
Missing (%) 26.1%
Memory size 3.8 MiB
N
320418
Y
 
49160
(Missing)
130559
Value Count Frequency (%)  
N 320418 64.1%
 
Y 49160 9.8%
 
(Missing) 130559 26.1%
 

LOAN_PURPOSE
Categorical

Distinct count 3
Unique (%) < 0.1%
Missing 0
Missing (%) 0.0%
Memory size 3.8 MiB
P
214791
N
174293
C
111053
Value Count Frequency (%)  
P 214791 42.9%
 
N 174293 34.8%
 
C 111053 22.2%
 

Composition

Contains chars True
Contains digits False
Contains whitespace False
Contains non-words False

Length

Max length 1
Mean length 1
Min length 1
Scatter

LOAN_SEQUENCE_NUMBER
Categorical

UNIQUE
HIGH CARDINALITY
Distinct count 500137
Unique (%) 100.0%
Missing 0
Missing (%) 0.0%
Memory size 3.8 MiB
F101Q1123130
 
1
F199Q1169655
 
1
F199Q1044484
 
1
F199Q1317097
 
1
F100Q1041580
 
1
Other values (500132)
500132
Value Count Frequency (%)  
F101Q1123130 1 < 0.1%
 
F199Q1169655 1 < 0.1%
 
F199Q1044484 1 < 0.1%
 
F199Q1317097 1 < 0.1%
 
F100Q1041580 1 < 0.1%
 
F199Q1161220 1 < 0.1%
 
F199Q1335685 1 < 0.1%
 
F102Q1091349 1 < 0.1%
 
F101Q1206245 1 < 0.1%
 
F101Q1170643 1 < 0.1%
 
Other values (500127) 500127 > 99.9%
 

Composition

Contains chars True
Contains digits True
Contains whitespace False
Contains non-words False

Length

Max length 12
Mean length 12
Min length 12
Scatter

MATURITY_DATE
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count 122
Unique (%) < 0.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Mean 203023.1959
Minimum 202402
Maximum 204101
Zeros 0
Zeros (%) 0.0%
Memory size 3.8 MiB
Mini histogram

Quantile statistics

Minimum 202402
5-th percentile 202902
Q1 202903
median 203004
Q3 203104
95-th percentile 203202
Maximum 204101
Range 1699
Interquartile range (IQR) 201

Descriptive statistics

Standard deviation 110.3841886
Coefficient of variation (CV) 0.0005437023493
Kurtosis -1.163418093
Mean 203023.1959
Median Absolute Deviation (MAD) 100.6100265
Skewness 0.09022322708
Sum 1.015394121e+11
Variance 12184.66908
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[202402. 202408.5 202411.5 202501.5 202504.5 ... 203211.5 203301.5 203304.5 203558. 204101. ], "bayesian blocks" binning strategy used)
Value Count Frequency (%)  
203104 72734 14.5%
 
202904 67591 13.5%
 
202902 62308 12.5%
 
202903 62161 12.4%
 
203103 57136 11.4%
 
203202 44254 8.8%
 
203102 40558 8.1%
 
203004 24252 4.8%
 
203003 20652 4.1%
 
203002 19382 3.9%
 
Other values (112) 29109 5.8%
 
Value Count Frequency (%)  
202402 1 < 0.1%
 
202403 2 < 0.1%
 
202404 2 < 0.1%
 
202405 4 < 0.1%
 
202406 2 < 0.1%
 
Value Count Frequency (%)  
204101 1 < 0.1%
 
203612 1 < 0.1%
 
203504 1 < 0.1%
 
203502 2 < 0.1%
 
203406 1 < 0.1%
 

METROPOLITAN_STATISTICAL_AREA
Real number (ℝ≥0)

MISSING
Distinct count 391
Unique (%) 0.1%
Missing 70149
Missing (%) 14.0%
Infinite 0
Infinite (%) 0.0%
Mean 30777.82474
Minimum 10180
Maximum 49740
Zeros 0
Zeros (%) 0.0%
Memory size 3.8 MiB
Mini histogram

Quantile statistics

Minimum 10180
5-th percentile 12420
Q1 19740
median 33340
Q3 40420
95-th percentile 47644
Maximum 49740
Range 39560
Interquartile range (IQR) 20680

Descriptive statistics

Standard deviation 11333.40114
Coefficient of variation (CV) 0.3682326883
Kurtosis -1.267260635
Mean 30777.82474
Median Absolute Deviation (MAD) 9961.585815
Skewness -0.2019569847
Sum 1.32340953e+10
Variance 128445981.5
Histogram
Histogram with fixed size bins (bins=10)
Value Count Frequency (%)  
16974 17051 3.4%
 
31084 12933 2.6%
 
12060 11603 2.3%
 
38060 11039 2.2%
 
33460 10773 2.2%
 
19740 9820 2.0%
 
47644 9763 2.0%
 
47894 8595 1.7%
 
42044 7788 1.6%
 
41740 7376 1.5%
 
Other values (380) 323247 64.6%
 
(Missing) 70149 14.0%
 
Value Count Frequency (%)  
10180 49 < 0.1%
 
10380 1 < 0.1%
 
10420 1193 0.2%
 
10500 93 < 0.1%
 
10580 477 0.1%
 
Value Count Frequency (%)  
49740 184 < 0.1%
 
49700 196 < 0.1%
 
49660 515 0.1%
 
49620 704 0.1%
 
49420 327 0.1%
 

MORTGAGE_INSURANCE_PERCENTAGE
Real number (ℝ≥0)

MISSING
ZEROS
Distinct count 41
Unique (%) < 0.1%
Missing 51048
Missing (%) 10.2%
Infinite 0
Infinite (%) 0.0%
Mean 7.744531708
Minimum 0
Maximum 55
Zeros 309979
Zeros (%) 62.0%
Memory size 3.8 MiB
Mini histogram

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
median 0
Q3 18
95-th percentile 30
Maximum 55
Range 55
Interquartile range (IQR) 18

Descriptive statistics

Standard deviation 12.04654597
Coefficient of variation (CV) 1.555490561
Kurtosis -0.7749831851
Mean 7.744531708
Median Absolute Deviation (MAD) 10.69273212
Skewness 1.025832107
Sum 3477984
Variance 145.1192698
Histogram
Histogram with fixed size bins (bins=10)
Value Count Frequency (%)  
0 309979 62.0%
 
30 53985 10.8%
 
25 53585 10.7%
 
12 16365 3.3%
 
17 5397 1.1%
 
18 4279 0.9%
 
35 1518 0.3%
 
20 722 0.1%
 
36 556 0.1%
 
29 509 0.1%
 
Other values (30) 2194 0.4%
 
(Missing) 51048 10.2%
 
Value Count Frequency (%)  
0 309979 62.0%
 
1 6 < 0.1%
 
5 1 < 0.1%
 
6 177 < 0.1%
 
8 1 < 0.1%
 
Value Count Frequency (%)  
55 2 < 0.1%
 
53 1 < 0.1%
 
52 2 < 0.1%
 
50 2 < 0.1%
 
47 2 < 0.1%
 

NUMBER_OF_BORROWERS
Categorical

Distinct count 3
Unique (%) < 0.1%
Missing 247
Missing (%) < 0.1%
Memory size 3.8 MiB
2
315078
1
184812
Value Count Frequency (%)  
2 315078 63.0%
 
1 184812 37.0%
 
(Missing) 247 < 0.1%
 

Composition

Contains chars True
Contains digits True
Contains whitespace False
Contains non-words True

Length

Max length 3
Mean length 3
Min length 3
Scatter

NUMBER_OF_UNITS
Categorical

Distinct count 5
Unique (%) < 0.1%
Missing 3
Missing (%) < 0.1%
Memory size 3.8 MiB
1
489352
2
 
8359
4
 
1244
3
 
1179
Value Count Frequency (%)  
1 489352 97.8%
 
2 8359 1.7%
 
4 1244 0.2%
 
3 1179 0.2%
 
(Missing) 3 < 0.1%
 

Composition

Contains chars True
Contains digits True
Contains whitespace False
Contains non-words True

Length

Max length 3
Mean length 3
Min length 3
Scatter

OCCUPANCY_STATUS
Categorical

Distinct count 3
Unique (%) < 0.1%
Missing 0
Missing (%) 0.0%
Memory size 3.8 MiB
O
465817
I
 
20109
S
 
14211
Value Count Frequency (%)  
O 465817 93.1%
 
I 20109 4.0%
 
S 14211 2.8%
 

Composition

Contains chars True
Contains digits False
Contains whitespace False
Contains non-words False

Length

Max length 1
Mean length 1
Min length 1
Scatter

ORIGINAL_COMBINED_LOAN_TO_VALUE
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count 116
Unique (%) < 0.1%
Missing 13
Missing (%) < 0.1%
Infinite 0
Infinite (%) 0.0%
Mean 76.05357071
Minimum 6
Maximum 180
Zeros 0
Zeros (%) 0.0%
Memory size 3.8 MiB
Mini histogram

Quantile statistics

Minimum 6
5-th percentile 45
Q1 70
median 80
Q3 88
95-th percentile 95
Maximum 180
Range 174
Interquartile range (IQR) 18

Descriptive statistics

Standard deviation 15.13998605
Coefficient of variation (CV) 0.1990700227
Kurtosis 1.458289543
Mean 76.05357071
Median Absolute Deviation (MAD) 11.27574618
Skewness -1.117783997
Sum 38036216
Variance 229.2191775
Histogram
Histogram with fixed size bins (bins=10)
Value Count Frequency (%)  
80 112011 22.4%
 
95 55449 11.1%
 
90 43065 8.6%
 
75 26192 5.2%
 
79 14931 3.0%
 
78 11732 2.3%
 
77 10213 2.0%
 
74 10111 2.0%
 
70 10104 2.0%
 
85 9271 1.9%
 
Other values (105) 197045 39.4%
 
Value Count Frequency (%)  
6 12 < 0.1%
 
7 19 < 0.1%
 
8 35 < 0.1%
 
9 29 < 0.1%
 
10 43 < 0.1%
 
Value Count Frequency (%)  
180 1 < 0.1%
 
175 1 < 0.1%
 
160 12 < 0.1%
 
159 1 < 0.1%
 
156 1 < 0.1%
 

ORIGINAL_DEBT_TO_INCOME_RATIO
Real number (ℝ≥0)

MISSING
Distinct count 66
Unique (%) < 0.1%
Missing 14929
Missing (%) 3.0%
Infinite 0
Infinite (%) 0.0%
Mean 32.91754052
Minimum 1
Maximum 65
Zeros 0
Zeros (%) 0.0%
Memory size 3.8 MiB
Mini histogram

Quantile statistics

Minimum 1
5-th percentile 15
Q1 25
median 33
Q3 41
95-th percentile 51
Maximum 65
Range 64
Interquartile range (IQR) 16

Descriptive statistics

Standard deviation 11.11179999
Coefficient of variation (CV) 0.3375647093
Kurtosis -0.2547628592
Mean 32.91754052
Median Absolute Deviation (MAD) 8.974398479
Skewness 0.06650457841
Sum 15971854
Variance 123.4720991
Histogram
Histogram with fixed size bins (bins=10)
Value Count Frequency (%)  
28 20186 4.0%
 
36 16692 3.3%
 
33 16398 3.3%
 
35 16323 3.3%
 
34 16292 3.3%
 
32 16021 3.2%
 
37 15988 3.2%
 
31 15783 3.2%
 
38 15714 3.1%
 
30 15576 3.1%
 
Other values (55) 320235 64.0%
 
Value Count Frequency (%)  
1 117 < 0.1%
 
2 216 < 0.1%
 
3 363 0.1%
 
4 447 0.1%
 
5 574 0.1%
 
Value Count Frequency (%)  
65 524 0.1%
 
64 572 0.1%
 
63 675 0.1%
 
62 662 0.1%
 
61 768 0.2%
 

ORIGINAL_INTEREST_RATE
Real number (ℝ≥0)

Distinct count 472
Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Mean 7.182686864
Minimum 4.625
Maximum 11.5
Zeros 0
Zeros (%) 0.0%
Memory size 3.8 MiB
Mini histogram

Quantile statistics

Minimum 4.625
5-th percentile 6.5
Q1 6.875
median 7
Q3 7.375
95-th percentile 8.49
Maximum 11.5
Range 6.875
Interquartile range (IQR) 0.5

Descriptive statistics

Standard deviation 0.5799408624
Coefficient of variation (CV) 0.08074149318
Kurtosis 1.498757734
Mean 7.182686864
Median Absolute Deviation (MAD) 0.4382486878
Skewness 1.224618691
Sum 3592327.46
Variance 0.3363314039
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[ 4.625 4.9375 5.4 5.4975 5.51 ... 9.995 10.0625 10.5625 10.8 11.5 ], "bayesian blocks" binning strategy used)
Value Count Frequency (%)  
6.875 88342 17.7%
 
7 62799 12.6%
 
6.75 56918 11.4%
 
7.125 43344 8.7%
 
7.25 42294 8.5%
 
6.625 27526 5.5%
 
7.375 27106 5.4%
 
6.5 21847 4.4%
 
7.5 17728 3.5%
 
8.25 12304 2.5%
 
Other values (462) 99929 20.0%
 
Value Count Frequency (%)  
4.625 1 < 0.1%
 
4.73 1 < 0.1%
 
4.75 2 < 0.1%
 
4.875 6 < 0.1%
 
5 20 < 0.1%
 
Value Count Frequency (%)  
11.5 1 < 0.1%
 
10.875 2 < 0.1%
 
10.85 1 < 0.1%
 
10.75 5 < 0.1%
 
10.625 8 < 0.1%
 

ORIGINAL_LOAN_TERM
Real number (ℝ≥0)

Distinct count 62
Unique (%) < 0.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Mean 359.8554696
Minimum 301
Maximum 362
Zeros 0
Zeros (%) 0.0%
Memory size 3.8 MiB
Mini histogram

Quantile statistics

Minimum 301
5-th percentile 360
Q1 360
median 360
Q3 360
95-th percentile 360
Maximum 362
Range 61
Interquartile range (IQR) 0

Descriptive statistics

Standard deviation 1.90825071
Coefficient of variation (CV) 0.005302825361
Kurtosis 366.4589173
Mean 359.8554696
Median Absolute Deviation (MAD) 0.2863856088
Skewness -17.67165681
Sum 179977035
Variance 3.641420774
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[301. 304.5 311.5 312.5 323.5 ... 356.5 358.5 359.5 360.5 362. ], "bayesian blocks" binning strategy used)
Value Count Frequency (%)  
360 495446 99.1%
 
354 607 0.1%
 
348 405 0.1%
 
349 282 0.1%
 
336 209 < 0.1%
 
350 208 < 0.1%
 
353 204 < 0.1%
 
359 194 < 0.1%
 
351 191 < 0.1%
 
352 163 < 0.1%
 
Other values (52) 2228 0.4%
 
Value Count Frequency (%)  
301 6 < 0.1%
 
302 6 < 0.1%
 
303 4 < 0.1%
 
304 5 < 0.1%
 
305 10 < 0.1%
 
Value Count Frequency (%)  
362 1 < 0.1%
 
361 6 < 0.1%
 
360 495446 99.1%
 
359 194 < 0.1%
 
358 99 < 0.1%
 

ORIGINAL_LOAN_TO_VALUE
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count 96
Unique (%) < 0.1%
Missing 9
Missing (%) < 0.1%
Infinite 0
Infinite (%) 0.0%
Mean 75.71071406
Minimum 6
Maximum 100
Zeros 0
Zeros (%) 0.0%
Memory size 3.8 MiB
Mini histogram

Quantile statistics

Minimum 6
5-th percentile 45
Q1 70
median 80
Q3 85
95-th percentile 95
Maximum 100
Range 94
Interquartile range (IQR) 15

Descriptive statistics

Standard deviation 14.93771709
Coefficient of variation (CV) 0.1972999103
Kurtosis 1.531392305
Mean 75.71071406
Median Absolute Deviation (MAD) 11.05492102
Skewness -1.140481569
Sum 37865048
Variance 223.1353918
Histogram
Histogram with fixed size bins (bins=10)
Value Count Frequency (%)  
80 122496 24.5%
 
95 50433 10.1%
 
90 37716 7.5%
 
75 26517 5.3%
 
79 15344 3.1%
 
78 12042 2.4%
 
77 10462 2.1%
 
74 10223 2.0%
 
70 10156 2.0%
 
85 9052 1.8%
 
Other values (85) 195687 39.1%
 
Value Count Frequency (%)  
6 12 < 0.1%
 
7 19 < 0.1%
 
8 35 < 0.1%
 
9 29 < 0.1%
 
10 43 < 0.1%
 
Value Count Frequency (%)  
100 596 0.1%
 
99 11 < 0.1%
 
98 10 < 0.1%
 
97 5532 1.1%
 
96 125 < 0.1%
 

ORIGINAL_UPB
Real number (ℝ≥0)

Distinct count 433
Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Mean 136493.4848
Minimum 8000
Maximum 578000
Zeros 0
Zeros (%) 0.0%
Memory size 3.8 MiB
Mini histogram

Quantile statistics

Minimum 8000
5-th percentile 52000
Q1 89000
median 126000
Q3 176000
95-th percentile 250000
Maximum 578000
Range 570000
Interquartile range (IQR) 87000

Descriptive statistics

Standard deviation 60968.74307
Coefficient of variation (CV) 0.4466787786
Kurtosis -0.2172889233
Mean 136493.4848
Median Absolute Deviation (MAD) 49931.7113
Skewness 0.5810446598
Sum 6.8265442e+10
Variance 3717187631
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[ 8000. 12500. 16500. 19500. 20500. ... 424500. 426000. 458500. 462500. 578000.], "bayesian blocks" binning strategy used)
Value Count Frequency (%)  
100000 9471 1.9%
 
275000 7859 1.6%
 
240000 7655 1.5%
 
120000 6162 1.2%
 
150000 5964 1.2%
 
80000 5628 1.1%
 
200000 5542 1.1%
 
90000 5483 1.1%
 
140000 5145 1.0%
 
110000 4987 1.0%
 
Other values (423) 436241 87.2%
 
Value Count Frequency (%)  
8000 1 < 0.1%
 
9000 1 < 0.1%
 
10000 7 < 0.1%
 
11000 3 < 0.1%
 
12000 3 < 0.1%
 
Value Count Frequency (%)  
578000 1 < 0.1%
 
560000 2 < 0.1%
 
544000 1 < 0.1%
 
529000 4 < 0.1%
 
525000 1 < 0.1%
 

POSTAL_CODE
Real number (ℝ≥0)

Distinct count 893
Unique (%) 0.2%
Missing 31
Missing (%) < 0.1%
Infinite 0
Infinite (%) 0.0%
Mean 55490.85714
Minimum 600
Maximum 99900
Zeros 0
Zeros (%) 0.0%
Memory size 3.8 MiB
Mini histogram

Quantile statistics

Minimum 600
5-th percentile 7000
Q1 30500
median 54200
Q3 85000
95-th percentile 97100
Maximum 99900
Range 99300
Interquartile range (IQR) 54500

Descriptive statistics

Standard deviation 29505.38226
Coefficient of variation (CV) 0.5317161021
Kurtosis -1.240880713
Mean 55490.85714
Median Absolute Deviation (MAD) 25685.62965
Skewness -0.0841622712
Sum 2.77513106e+10
Variance 870567582.2
Histogram
Histogram with fixed size bins (bins=10)
Value Count Frequency (%)  
94500 7240 1.4%
 
85200 5775 1.2%
 
30000 5733 1.1%
 
48100 5286 1.1%
 
60000 5161 1.0%
 
48000 4347 0.9%
 
92600 4305 0.9%
 
60100 4295 0.9%
 
60600 4071 0.8%
 
75000 4035 0.8%
 
Other values (882) 449858 89.9%
 
Value Count Frequency (%)  
600 159 < 0.1%
 
700 219 < 0.1%
 
900 466 0.1%
 
1000 346 0.1%
 
1100 69 < 0.1%
 
Value Count Frequency (%)  
99900 17 < 0.1%
 
99800 68 < 0.1%
 
99700 42 < 0.1%
 
99600 139 < 0.1%
 
99500 461 0.1%
 

PREPAID
Boolean

Distinct count 2
Unique (%) < 0.1%
Missing 0
Missing (%) 0.0%
Memory size 488.5 KiB
True
480724
False
 
19413
Value Count Frequency (%)  
True 480724 96.1%
 
False 19413 3.9%
 
Distinct count 3
Unique (%) < 0.1%
Missing 5178
Missing (%) 1.0%
Memory size 3.8 MiB
N
492669
Y
 
2290
(Missing)
 
5178
Value Count Frequency (%)  
N 492669 98.5%
 
Y 2290 0.5%
 
(Missing) 5178 1.0%
 

PRODUCT_TYPE
Categorical

CONST
Distinct count 1
Unique (%) < 0.1%
Missing 0
Missing (%) 0.0%
Memory size 3.8 MiB
FRM
500137
Value Count Frequency (%)  
FRM 500137 100.0%
 

Composition

Contains chars True
Contains digits False
Contains whitespace False
Contains non-words False

Length

Max length 3
Mean length 3
Min length 3
Scatter

PROPERTY_STATE
Categorical

HIGH CARDINALITY
Distinct count 53
Unique (%) < 0.1%
Missing 0
Missing (%) 0.0%
Memory size 3.8 MiB
CA
72566
FL
 
30088
MI
 
26956
IL
 
26175
TX
 
22786
Other values (48)
321566
Value Count Frequency (%)  
CA 72566 14.5%
 
FL 30088 6.0%
 
MI 26956 5.4%
 
IL 26175 5.2%
 
TX 22786 4.6%
 
OH 20334 4.1%
 
CO 17943 3.6%
 
GA 16490 3.3%
 
AZ 16072 3.2%
 
NC 16052 3.2%
 
Other values (43) 234675 46.9%
 

Composition

Contains chars True
Contains digits False
Contains whitespace False
Contains non-words False

Length

Max length 2
Mean length 2
Min length 2
Scatter

PROPERTY_TYPE
Categorical

Distinct count 7
Unique (%) < 0.1%
Missing 95
Missing (%) < 0.1%
Memory size 3.8 MiB
SF
410630
PU
 
53455
CO
 
33639
MH
 
1741
CP
 
380
Value Count Frequency (%)  
SF 410630 82.1%
 
PU 53455 10.7%
 
CO 33639 6.7%
 
MH 1741 0.3%
 
CP 380 0.1%
 
LH 197 < 0.1%
 
(Missing) 95 < 0.1%
 

Composition

Contains chars True
Contains digits False
Contains whitespace False
Contains non-words False

Length

Max length 3
Mean length 2.000189948
Min length 2
Scatter

SELLER_NAME
Categorical

Distinct count 48
Unique (%) < 0.1%
Missing 0
Missing (%) 0.0%
Memory size 3.8 MiB
Other sellers
109360
WELLSFARGOHOMEMORTGA
63768
ABNAMROMTGEGROUP,INC
50543
NORWEST MORTGAGE, IN
 
23080
BANKOFAMERICA,NA
 
21064
Other values (43)
232322
Value Count Frequency (%)  
Other sellers 109360 21.9%
 
WELLSFARGOHOMEMORTGA 63768 12.8%
 
ABNAMROMTGEGROUP,INC 50543 10.1%
 
NORWEST MORTGAGE, IN 23080 4.6%
 
BANKOFAMERICA,NA 21064 4.2%
 
NATLCITYMTGECO 18303 3.7%
 
COUNTRYWIDE HOME LOA 17416 3.5%
 
NORWESTMORTGAGE,INC 17248 3.4%
 
PRINCIPALRESIDENTIAL 13603 2.7%
 
STANDARD FEDERAL BAN 11591 2.3%
 
Other values (38) 154161 30.8%
 

Composition

Contains chars True
Contains digits False
Contains whitespace True
Contains non-words True

Length

Max length 20
Mean length 17.60952099
Min length 8
Scatter

SERVICER_NAME
Categorical

Distinct count 26
Unique (%) < 0.1%
Missing 0
Missing (%) 0.0%
Memory size 3.8 MiB
Other servicers
94141
WELLSFARGOHOMEMORTGA
86449
BANKOFAMERICA,NA
42354
WASHINGTONMUTUALBANK
 
38851
ABNAMROMTGEGROUP,INC
 
38145
Other values (21)
200197
Value Count Frequency (%)  
Other servicers 94141 18.8%
 
WELLSFARGOHOMEMORTGA 86449 17.3%
 
BANKOFAMERICA,NA 42354 8.5%
 
WASHINGTONMUTUALBANK 38851 7.8%
 
ABNAMROMTGEGROUP,INC 38145 7.6%
 
CHASEMTGECO 26843 5.4%
 
NATLCITYMTGECO 22907 4.6%
 
WELLSFARGOBANK,NA 22888 4.6%
 
COUNTRYWIDE 18494 3.7%
 
PRINCIPALRESIDENTIAL 14962 3.0%
 
Other values (16) 94103 18.8%
 

Composition

Contains chars True
Contains digits False
Contains whitespace True
Contains non-words True

Length

Max length 20
Mean length 16.92317705
Min length 8
Scatter

Correlations

Missing values

Sample

First rows

CHANNEL CREDIT_SCORE DELINQUENT FIRST_PAYMENT_DATE FIRST_TIME_HOMEBUYER_FLAG LOAN_PURPOSE LOAN_SEQUENCE_NUMBER MATURITY_DATE METROPOLITAN_STATISTICAL_AREA MORTGAGE_INSURANCE_PERCENTAGE NUMBER_OF_BORROWERS NUMBER_OF_UNITS OCCUPANCY_STATUS ORIGINAL_COMBINED_LOAN_TO_VALUE ORIGINAL_DEBT_TO_INCOME_RATIO ORIGINAL_INTEREST_RATE ORIGINAL_LOAN_TERM ORIGINAL_LOAN_TO_VALUE ORIGINAL_UPB POSTAL_CODE PREPAID PREPAYMENT_PENALTY_MORTGAGE_FLAG PRODUCT_TYPE PROPERTY_STATE PROPERTY_TYPE SELLER_NAME SERVICER_NAME
0 R 669.0 False 200206 N P F199Q1000004 202901 NaN 0.0 2.0 1.0 O 80.0 33.0 7.120 320 80.0 162000 26100.0 True N FRM WV SF Other sellers Other servicers
1 R 732.0 False 199904 N N F199Q1000005 202903 17140.0 0.0 1.0 1.0 O 25.0 10.0 6.500 360 25.0 53000 45200.0 True N FRM OH SF Other sellers Other servicers
2 R 679.0 False 200208 N P F199Q1000007 202902 15940.0 30.0 1.0 1.0 O 91.0 48.0 6.750 319 91.0 133000 44700.0 True N FRM OH SF Other sellers Other servicers
3 T 721.0 False 200209 N N F199Q1000013 202902 38060.0 0.0 2.0 1.0 O 39.0 13.0 6.625 318 39.0 174000 85200.0 True N FRM AZ SF Other sellers Other servicers
4 R 618.0 False 200210 N N F199Q1000015 202902 10420.0 25.0 2.0 1.0 O 85.0 24.0 6.375 317 85.0 122000 44200.0 True N FRM OH SF Other sellers Other servicers
5 R 738.0 False 200211 N P F199Q1000016 202903 10420.0 0.0 2.0 1.0 O 73.0 44.0 6.000 317 73.0 218000 44300.0 True N FRM OH SF Other sellers Other servicers
6 R 761.0 False 200211 N P F199Q1000017 202904 NaN 0.0 2.0 1.0 O 73.0 31.0 6.375 318 73.0 138000 29500.0 True N FRM SC PU Other sellers Other servicers
7 R 707.0 False 200211 N C F199Q1000018 202903 33340.0 0.0 2.0 1.0 O 60.0 57.0 6.250 317 60.0 136000 53000.0 True N FRM WI SF Other sellers Other servicers
8 R 760.0 False 200211 N N F199Q1000019 202903 33340.0 0.0 2.0 1.0 O 63.0 30.0 6.125 317 63.0 79000 53000.0 True N FRM WI SF Other sellers Other servicers
9 R 691.0 False 200302 N P F199Q1000023 202901 15940.0 0.0 2.0 1.0 O 65.0 25.0 5.875 312 65.0 130000 44700.0 True N FRM OH SF Other sellers Other servicers

Last rows

CHANNEL CREDIT_SCORE DELINQUENT FIRST_PAYMENT_DATE FIRST_TIME_HOMEBUYER_FLAG LOAN_PURPOSE LOAN_SEQUENCE_NUMBER MATURITY_DATE METROPOLITAN_STATISTICAL_AREA MORTGAGE_INSURANCE_PERCENTAGE NUMBER_OF_BORROWERS NUMBER_OF_UNITS OCCUPANCY_STATUS ORIGINAL_COMBINED_LOAN_TO_VALUE ORIGINAL_DEBT_TO_INCOME_RATIO ORIGINAL_INTEREST_RATE ORIGINAL_LOAN_TERM ORIGINAL_LOAN_TO_VALUE ORIGINAL_UPB POSTAL_CODE PREPAID PREPAYMENT_PENALTY_MORTGAGE_FLAG PRODUCT_TYPE PROPERTY_STATE PROPERTY_TYPE SELLER_NAME SERVICER_NAME
500127 R 754.0 False 200203 NaN N F102Q1125969 203202 15180.0 0.0 2.0 1.0 O 68.0 31.0 6.625 360 68.0 40000 78500.0 True N FRM TX SF WELLSFARGOHOMEMORTGA WELLSFARGOBANK,NA
500128 R 744.0 False 200203 NaN N F102Q1125972 203202 NaN 0.0 1.0 1.0 O 74.0 37.0 6.625 360 74.0 66000 75100.0 True N FRM TX SF WELLSFARGOHOMEMORTGA WELLSFARGOBANK,NA
500129 R 722.0 False 200203 NaN N F102Q1125977 203202 49020.0 0.0 1.0 1.0 O 89.0 21.0 6.625 360 79.0 78000 22600.0 True N FRM VA SF WELLSFARGOHOMEMORTGA WELLSFARGOBANK,NA
500130 R 673.0 True 200203 NaN N F102Q1125982 203202 16740.0 0.0 2.0 1.0 O 55.0 35.0 6.625 360 55.0 80000 28200.0 False N FRM NC SF WELLSFARGOHOMEMORTGA WELLSFARGOBANK,NA
500131 R 774.0 False 200203 NaN N F102Q1125985 203202 19380.0 0.0 2.0 1.0 O 57.0 15.0 6.625 360 57.0 59000 45400.0 True N FRM OH SF WELLSFARGOHOMEMORTGA WELLSFARGOBANK,NA
500132 R 774.0 False 200203 NaN C F102Q1125986 203202 33460.0 0.0 1.0 1.0 O 61.0 38.0 6.625 360 61.0 76000 55400.0 True N FRM MN SF WELLSFARGOHOMEMORTGA WELLSFARGOBANK,NA
500133 R 689.0 False 200203 NaN N F102Q1125989 203202 10580.0 0.0 1.0 1.0 O 70.0 39.0 6.625 360 70.0 70000 12300.0 True N FRM NY SF WELLSFARGOHOMEMORTGA WELLSFARGOHOMEMORTGA
500134 R 798.0 False 200203 NaN C F102Q1125990 203202 19780.0 0.0 1.0 1.0 O 56.0 41.0 6.625 360 56.0 65000 50300.0 True N FRM IA SF WELLSFARGOHOMEMORTGA WELLSFARGOBANK,NA
500135 R 791.0 False 200203 NaN N F102Q1125991 203202 42044.0 0.0 1.0 1.0 O 26.0 18.0 6.625 360 26.0 51000 92600.0 True N FRM CA SF WELLSFARGOHOMEMORTGA WELLSFARGOBANK,NA
500136 T 773.0 False 200203 NaN N F102Q1125993 203202 NaN 0.0 1.0 1.0 O 33.0 48.0 6.625 360 33.0 82000 33000.0 True N FRM FL SF WELLSFARGOHOMEMORTGA WELLSFARGOHOMEMORTGA